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Sequence listing 

(1) Nanlte of applicant: JAPAN TABACCO INC. 

(2) Titfe of the invention: Cell Surface Molecule Mediating Cell Adhesion and 
Signal Transmission 

5 (3) Reference number: J1-802PCT 

(4) Application number: 

(5) Filing nate: 

(6) Country where the priority application was filed and the application number 
of the application: 

10 JapanANo. Hei 9-062290 

Japan, latent Application filed on February 26, 1998 (reference number, 
J1-802DP1) 

(7) Priority date: February 27, 1997 

(8) Number of sequences: 12 

o 15 

£ SEQ ID NO: 1: 
| SEQUENCE LENGTH: 600 
H SEQUENCE TYPE: nucl\ic acid 
TOPOLOGY: linear 
20 MOLECULE TYPE: cDNA tb mRNA 
ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
fy FEATURE: 

NAME/KEY: CDS 
2.25 LOCATION: 1 60C 

IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: SEQ ID NO: 1 

ATG AAG TCA GGC CTC TGG TAfl TTC TTT CTC 30 
Met Lys Ser Gly Leu Trp Tyr\Phe Phe Leu 
30 1 5 \ 10 

TTC TGC TTG CGC ATT AAA GTT TTA ACA GGA 60 
Phe Cys Leu Arg He Lys Val Leu Thr Gly 
15 \ 20 

GAA ATC AAT GGT TCT GCC AAT TATUJAG ATG 90 
35 Glu He Asn Gly Ser Ala Asn Tyr Glu Met 

25 \ 30 



in 



TS=? 
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TTT 
Phe 

TTA 
5 Leu 

TTT 
Phe 

10 ATA 
He 

AGT 
Ser 

15 

AAA 
Lys 

AGT 
20 Ser 

CAT 
His 

25 CTA 
Leu 

GTA 
Val 



30 



TGC 

Cys Ly 

AAA ATG 
Lys Met 

CTC TGC 
Leu Cys 

GGA AAC 
Gly Asn 

TTC TGC 
Phe Cys 



TAT 
Tyr 



TTC 
35 Phe 




GTC TCT 
Val Ser 

TCT CAT 
Ser His 

TCA ATT 
Ser He 

ACT CTT 
Thr Leu 

GAA TCA 
Glu Ser 

TGG TTA 
Trp Leu 



CAC AAC uGA 
His Asn Gly 
35 

TAT CCT GAC 
Tyr Pro Asp 
45 

)AG TTG CTG 
n Leu Leu 
55 

GAt CTC ACT 
Asp\Leu Thr 
65 

ACA GTG TCC 
Thr vVl Ser 
(5 

CAT TO 
His Sei 
85 

TTT TTTT 
Phe Phe 
95 

GCC AAC ta: 
Ala Asn Tyr 
105 

TTT GAT CCT 
Phe Asp Pro 
115 

ACA GGA GGA 
Thr Gly Gly 

125 
CAA CTT TGT 
Gin Leu Cys 

135 

CCC ATA GGA 
Pro He Gly 
145 



• 



120 



TAT TO 
Tyr Deu 

TGC CAC 
Cys Gin 1 

TGT GCA 
Cys Ala 



150 



180 



210 



240 



270 



300 



330 



360 



390 



420 



450 
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10 



GTT OTA GTC 
Val m Val 

ATT TGTVrGG 
He Cys irp 

TCC AGT G' 
Ser Ser Vafl 

ATG TTC ATG 
Met Phe Met 

AAA TCT AGA 
Lys Ser Arg 



« 15 



TGC ATT TTG GGA 
Cys He Leu Gly 
155 

CTT ACA AAA AAG 
Leu Thr Lys Lys 
165 

CAC GAC CCT AAC 
His Asp Pro Asn 
175 

iGA GCA GTG AAC 
A^g Ala Val Asn 
185 

CTOi ACA GAT GTG 
Leu\Thr Asp Val 
$5 



TGC ATA CTT 
Cys He Leu 
160 

AAG TAT TCA 
Lys Tyr Ser 
170 

GGT GAA TAC 
Gly Glu Tyr 
180 

ACA GCC AAA 
Thr Ala Lys 
190 

ACC CTA TAA 
Thr Leu 



480 



510 



540 



570 



600 



% SEQUENCE ID NO: 2: 
2 SEQUENCE LENGTH: 19! 
JJ SEQUENCE TYPE: amino \ac id 
^20 TOPOLOGY: linear 
^ MOLECULE TYPE: protein 
| ORIGINAL SOURCE: 
m ORGANISM: Homo sapiens 

5 SEQUENCE DESCRIPTION: SEQ\ID NO: 
2 25 Met Lys Ser Gly Leu Trp Ty\ Phe 
1 5 
Phe Cys Leu Arg He Lys Val ^eu 
15 

Glu He Asn Gly Ser Ala Asn Tyi 
30 25 

Phe He Phe His Asn Gly Gly Val 
35 

Leu Cys Lys Tyr Pro Asp He Val 
45 

35 Phe Lys Met Gin Leu Leu Lys Gly 

55 



2 

Phe Leu 
10 

Thr Gly 
20 

Glu Met 
30 
n He 
40 
Gin 
50 

Gly Gfln 
61 



Gln\ 
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Ile^eu Cys Asp Leu Thr Lys Thr Lys Gly 
65 70 
Ser G\y Asn Thr Val Ser He Lys Ser Leu 
75 80 
Lys Phe\Cys His Ser Gin Leu Ser Asn Asn 
85 90 
Ser Val Ser Phe Phe Leu Tyr Asn Leu Asp 
95 100 
His Ser Hi^Ala Asn Tyr Tyr Phe Cys Asn 
10 \ 105 110 

Leu Ser He PJie Asp Pro Pro Pro Phe Lys 
115 120 
Val Thr Leu Thr\Gly Gly Tyr Leu His He 
L25 130 
„ 15 Tyr Glu Ser Gin Aeu Cys Cys Gin Leu Lys 

135 140 
Phe Trp Leu Pro He\ Gly Cys Ala Ala Phe 
145\ 150 
Val Val Val Cys He Deu Gly Cys He Leu 
20 155 \ 160 

He Cys Trp Leu Thr LysWs Lys Tyr Ser 
165 \ 170 
Ser Ser Val His Asp Pro Ak Gly Glu Tyr 
175 \ 180 
25 Met Phe Met Arg Ala Val AsnVhr Ala Lys 

185 \ 190 

Lys Ser Arg Leu Thr Asp Val ThV Leu 
195 



* 



y i 



m 



30 SEQUENCE ID NO: 3: 
SEQUENCE LENGTH: 2610 
SEQUENCE TYPE: nucleic acid 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acids (cltyA containing 3'- and 5 ' -untranslated 
35 sequences) 

ORIGINAL SOURCE: 
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ORGANISM: Homo sapiens 

FEATUR] 

^ S/KEY: CDS 
LOCATION: 26 .. 625 
5 IDENTIFICATION METHOD: E 

SEQUENCE DESCRIPTION: SEQ ID NO 
GGACTGTTAA €TGTTTCTGG CAAAC 
ATG AAG TCaViGC CTC TGG TAT TTC 
Met Lys Ser Gly Leu Trp Tyr Phe 
10 \ 5 

TTC TGC TTG CGft ATT AAA GTT TTA 
Phe Cys Leu Arg yle Lys Val Leu 
L5 

GAA ATC AAT GGT t6t GCC AAT TAT 
15 Glu He Asn Gly Ser\Ala Asn Tyr 

25 

TTT ATA TTT CAC AAC (3flA GGT GTA 
Phe lie Phe His Asn gW Gly Val 
35 

20 TTA TGC AAA TAT CCT GAC \1T GTC 
Leu Cys Lys Tyr Pro Asp ue Val 
45 

TTT AAA ATG CAG TTG CTG AAAXGGG 
Phe Lys Met Gin Leu Leu Lys C^ly 
25 55 

ATA CTC TGC GAT CTC ACT AAG ACj 
He Leu Cys Asp Leu Thr Lys Thr 
65 

AGT GGA AAC ACA GTG TCC ATT AAG 
30 Ser Gly Asn Thr Val Ser lie Lys 

75 

AAA TTC TGC CAT TCT CAG TTA TCC 
Lys Phe Cys His Ser Gin Leu Ser 
85 

35 AGT GTC TCT TTT TTT CTA TAC AAC 
Ser Val Ser Phe Phe Leu Tyr Asn 



TTT CTC 
Phe Leu 
10 

ACA GGA 
Thr Gly 
20 

GAG ATG 
Glu Met 
30 

CAA ATT 
Gin He 
40 

CAG CAA 
Gin Gin 
50 

GGG CAA 
Gly Gin 
60 

AAA GGA 
ys Gly 
70 
CTG 
Leu 
80 

AAC AAC 
Asn 
9( 

TTG GAC 
Leu Asp 



AST 
Se 



25 
55 



85 



115 



145 



175 



205 



235 



265 



295 



325 
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95 100 
CAT TCTyCAT GCC AAC TAT TAC TTC TGC AAC 
His SerW Ala Asn Tyr Tyr Phe Cys Asn 
105 110 
CTA TCA ATT TTT GAT CCT CCT CCT TTT AAA 
Leu Ser Ile\Phe Asp Pro Pro Pro Phe Lys 
115 120 
GTA ACT CTT ACA GGA GGA TAT TTG CAT ATT 
Val Thr Leu Thr Gly Gly Tyr Leu His He 
10 \ 125 130 

TAT GAA TCA CAAACTT TGT TGC CAG CTG AAG 
Tyr Glu Ser Gin Deu Cys Cys Gin Leu Lys 
135 140 
TTC TGG TTA CCC ATA\ GGA TGT GCA GCC TTT 
Q 15 Phe Trp Leu Pro He ISly Cys Ala Ala Phe 
3 145 \ 150 

GTT GTA GTC TGC ATT TO GGA TGC ATA CTT 

r a ; \ 

y Val Val Val Cys He LeuVly Cys He Leu 
| 155. \ 160 

tho ATT TGT TGG CTT ACA AAA fJk AAG TAT TCA 
^ He Cys Trp Leu Thr Lys LysYys Tyr Ser 
m 165 \ 170 

W TCC AGT GTG CAC GAC CCT AAC GGTIGAA TAC 
m Ser Ser Val His Asp Pro Asn GlyVlu Tyr 

175 \ 180 

ATG TTC ATG AGA GCA GTG AAC ACA GCC AAA 
Met Phe Met Arg Ala Val Asn Thr AlaUys 
185 \90 
AAA TCT AGA CTC ACA GAT GTG ACC CTA T) 
30 Lys Ser Arg Leu Thr Asp Val Thr Leu 

195 

TATGGAACTC TGGCACCCAG GCATGAAGCA CGTTGGC( 
CTTGAAGTGC AAGATTCTCT TATTTCCGGG ACCACGGA( 
ACTACATACA TCTTCTGCTG GTGTTTTGTT CAATCTGGAAN 
35 ATCAGTCAAT GGGGATTTTA ACAGACTGCC TTGGTACTGC 
CAAAACAAAC ACCCTCTTGC AACCAGCTTT GGAGAAAGCC 



♦ 



B25 



355 



385 



415 



445 



475 



505 



535 



565 



595 



625 



TTTTCCTCAA 675 

GTCTGACTTA 725 

GAATGACTGT 775 

KJAGTCCTCT 825 

CAGCTCCTGT 875 
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10 



^15 



u20 



25 



30 



35 



GTGCTCACTG 

ATCAGCCAfiT 

AGGGGTACTI 

CCGCATTTCA' 

TCTTTTAATC 

TGAGATGTTG 

AATTGCTGAA 

GTGCCCTCAA 

CCAACAGCCA 

GCTCAATAGT 

TAAAATTCAT 

TTGCTCCAGA 

TTTGTTCAAG 

AWWTTATTAT 

ATTACAAGTT 

CATCTTTAAT 

GCCTGAAAGC 

CCTCTGGGCT 

CCAGACCTGG 

CACTAGGTAT 

GGTCTACCTG 

AGGGCACAAT 

GCCCTTCAAG 

AAATATGTGG 

GTCTGCATGC 

TGTAACTTTA 

GTGTGCTTAT 

TATTATCTTA 

TTGTAATTTG 

CTGACTGGCT 

GTGGGGTGGG 

GTTGTCCAAG 

CAAGATGAAT 

AGAGGAGCCA 

TAAACATTTG 



GGAGTGGAAT 
AAAACAAACA 
AATCTGCAAA 
iCTATCATACT 
lGTCAAGGGA 

atgtgaactg 
ccccagttga 

TTTTCTTTTT 

CTCTCAATAG 

TTTATATATC 

AATGAATATA 

AAGACATGTT 

TTAGTGGIAG 

CCTATTTTG^T 

TAGTTCTTT 

GGGCCAGCAT 

TGCAGTTACT 

TGACAGGTCA 

GTGGAATTCC 

TCTTGCTCCC 

CATTCATAAT 

TCCCTCTCAT 

ATAGCCTTCT 

AATATGATTA 

CCAAGGCTTC 

GGTAAACTGG 

AGTTTACAAG 

AGCATGTGTA 

AATCTAGTAT 

TTGCACAGGT 

GTATGGGGAG 

CTGTGCCTCG 

AATAATTCAC 

CTTATCAAAG 

ATATTCAGCA 



CCCTGTCTCC 
CATTTACAAG 
GCAAATGAGC 
ACCTCTTCTT 
GATGCTTCAA 
TACATTAGTA 
CCATTTTACC 
AAAAATACTT 
AGAGCTATGT 
TATGCATACA 
TTTGCCTATA 
CTTTTCTCAA 
GAAACATTGC 
ACCATTATCT 
TGTAGATCAT 
JCTCATGGGG 
JAGGTTGCT 
JGGTCCC 
AG&GTTGAGA 
AGAMCTGAA 
TCCASGATCT 
AAAAAOCACA 
TTAGAATATG 
TTCTTAGOyTG 
TGAAGCAGI 
GATTATGTTI 
TGAGACCCGA 
ATGCTGGATG 
GGTGTTCTGT 
GTTCCCTGAG 
GAGAACCTTC 
ACACATCCTC 
AAAATTTCTG 
AGATTTTAAC 
ACTGAAAAAA 



acatctgctc 
aaaaatgttt 
agccaaggac 
tctgtagggr 
agctggrgct 
catactcagt 
aagactttag 
ctacatgact 
cttacattct 
tatatacaca 
ttctccctac 
attcagttaa 
ccggaattga 
atgttttcat 
attaaaattg 
tagagcagaa 
gtcagactat 
catcagcctg 
gactcccctg 
gtcaccctgg 
gtgaagagca 
cagcctggaa 
atttggctag 
gaatattttc 
aatgtcgatg 
tagtttaaca 
atgtcatta 
tWagtac 
tttcagctga 
ttgtttgcag 
atggtggccc 
atcccmgca 

TGAAATCl 
AGTAGTAAISA 
AAAAA 



CTAGCAGTGC 925 
TAAAGATGCC 975 
CAGCATCTGT 1025 
TGAGAATTCC 1075 
ATTTTATTTC 1125 
ACTCTCCTTC 1175 
ATGCTTTCTT 1225 
GCTTGACAGC 1275 
TTCCTCTGCT 1325 
CATATGTATA 1375 
AAGAATATTT 1425 
AATGGTTTAC 1475 
AAGCAAATTT 1525 
GGTGCTATTA 1575 
CAAACAAAAT 1625 
TATTCATTTA 1675 
ACCCATGGTG 1725 
GAGCAGCCCT 1775 
AGCCAGAGGC 1825 
GAATCACAGT 1875 
CATATGTGTC 1925 
ATTGGCCCTG 1975 
AAAGATTCTT 2025 
TCTACTTCCT 2075 
CAACAACATT 2125 
TTTTGTAACT 2175 
TGCATACTTA 2225 
AGTACWTAAC 2275 
CTTGGACAAC 2325 
GTTTCTGTGT 2375 
ACCTGGCCTG 2425 
TGGGACACCT 2475 
TCCAGTTTTA 2525 
AGGCAAAGAA 2575 
2610 
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SEQ ID NO: 4: 
SEQUENCE LENGTH: 2072 
SEQUENCE TYPE: nucleic acid 
TOPOLOGY\: linear 

5 MOLECULE VTYPE: other nucleic acids (cDNA containing 3'- and 5 '-untranslated 
sequences] 
ORIGINAL &URCE: 

ORGANISM: Rattus 
FEATURE: 
10 NAME/teY: CDS 

LOCATION: 35 .. 637 
IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: SEQ ID NO: 4 

CTGGAGGGGA AGAGTGCAGC TGTTCCTGGC AGAC 34 
15 ATG AAG CCC TAC TTC TCG TGC GTC TTT GTC 64 
Met Lys Pro Tyr Phe Ser Cys Val Phe Val 
1 \ 10 

TTC TGC TTC CTA ATCNAAA CTT TTA ACA GGA 94 
Phe Cys Phe Leu He Ms Leu Leu Thr Gly 
20 15 \ 20 

GAA CTC AAT GAC TTG GCQ AAT CAC AGG ATG 124 
Glu Leu Asn Asp Leu AlaVsn His Arg Met 
25 \ 30 
TTT TCG TTT CAC GAT GGA GOT GTA CAG ATT 154 
25 Phe Ser Phe His Asp Gly Gly\ Val Gin He 

35 \ 40 

TCT TGT AAC TAC CCT GAG ACT GTC CAG CAG 184 
Ser Cys Asn Tyr Pro Glu Thr VjM Gin Gin 
45 \ 50 

30 TTA AAA ATG CAG TTG TTC AAA GAcVgA GAA 214 
Leu Lys Met Gin Leu Phe Lys Asp Arg Glu 
55 \ 60 

GTC CTC TGC GAC CTC ACC AAG ACC AA& GGA 244 
Val Leu Cys Asp Leu Thr Lys Thr Lys Wy 
35 65 \70 

AGC GGA AAC ACC GTG TCC ATC AAG AAT CCG 274 
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Ser Gly Asn Thr 



10 



15 



20 



25 



30 



ATG TCC 
Met Ser 

AGT GTC 
Ser Val 

AGC TCC 
Ser Ser 

CTG TCG 
Leu Ser 

GAA AAG 
Glu Lys 

ATT TAT 
He Tyr 

AAG CTT 
Lys Leu 

TTT GTG 
Phe Val 

TTT ATC 
Phe He 

AGA TCC 
Arg Ser 

TAC ATG 
Tyr Met 



CCA 
Sys Pro 

TCT TTT 
Phe 

CAG 
Gin 

ATT TTC 
He Phe 

AAC CTT 
Asn Leu 

GAA TCC 
Glu Ser 

TGG TTA 
Trp Leu 

GCA GCG 
Ala Ala 

GTC TGG 
Val Trp 

AGT GTG 
Ser Val 

TTC ATG 
Phe Met 



35 



AAA AAG TCC AGA 



Val Ser 
75 

TAT CAG 
Tyr Gin 
85 

TTC CTA 
Phe Leu 
95 
AGC TAC 
Ser Tyr 
105 

iAC CCA 
jp Pro 

l' 

AGTxGGA 

Ser\ily 
125 
CAG C 
Gin 
135 

CCC GTA 
Pro Val 
145 

CTC CTT 
Leu Leu 
155 

TTT GCA 
Phe Ala 
165 

CAC GAC 
His Asp 
175 

GCG GCA 
Ala Ala 
185 

CTT GCA 



He Lys 

CTG TCC 
Leu Ser 

GAC AAC 
Asp Asn 

TTT TTA 
Phe Leu 

CCC CCT 
Pro Pro 

GGA TAT 
Gly Tyr 

TGT TGC 
Cys Cys 

jG TGT 
GlV Cys 

tttW 

Phe (My 

AAA AAI 
Lys Lys 

CCT AAT 
Pro Asn 

GTC AAC 
Val Asn 



Asn Pro 
80 

AAC AAC 
Asn Asn 
90 

GCA GAC 
Ala Asp 
100 
TGC AGC 
Cys Ser 
110 
TTT CAA 
Phe Gin 
120 
TTG CTT 
Leu Leu 
130 
CAG CTG 
Gin Leu 
140 
GCA GCT 
Ala Ala 
150 
TGC ATA 
Cys He 
160 
AAG TAC 
,ys Tyr 
170 
C GAG 
SeV Glu 
180 

ACA^ 
Thr 



GGT ATG ACC TC/ 



304 



334 



364 



394 



424 



454 



484 



514 



544 



574 



604 



634 



Lys Lys Ser Arg Leu Ala Gly Met Thr Ser 
\ 195 200 

TAA \ 

TCTGGAACAC gggaacccat ggaggaacta cactgtctag 

5 ACTTGAATGG AGAAAGTCTT CTATTTTCTG GACCACAGGG 
GATTAACTAK TGATACCTCC TTTTGGKGTT TTGTTTGTCT 
CTATCAGTCMCTCGGAATTT CAGCAGACTG CCCTGGGTTT 
TTTAAGGCAAACCCCTTCTT ATAGAAGACC CGGCTCATAT 
AACAGACCTC ACTGGGATAC AATCCCCTCT TTCTGCGCCT 
10 ATGCACCGGC CAGCAAGACA AACATATCTC CAGCATTTTT 
CAGGGTATGA ATCTGTAAAG TACACAGGCA GCCATTGACC 
CTCGTTTTTT CAGATTCTAT TTTTTTCCAT AGAGATCAGC 
GAATCAGACA GTAGAGGGAG ATGCTTCACA ACAGAAGCTC 
GAGATGTTGA TGAATTCATG CTTTAGTACC ACCATGTTCT 
q 15 CTATATTCCA GCTGATCACT GCTTCAGGGC TTAGATGCCT 
« TCAAGTCTCC CCTTAAAGAT ACTCCCACAG GTCTACTTGG 
| CCACTCTGAA TAGGAAGTTTx GGTCTACAAT TTCCCCCCTC 
W AAAAAAAAAT TAGTAGATAtNjATTTTCCCA TATTCTCCCT 
jjj TTTTTTCCAG CAAAGACATC TMATTCAGT TAATATGGTT 
U 20 ATATTAGTGG CAGTAAACAT TTCTCAGAAT CAAAAGCAAA 
%_ GGTGGTGTTT TTCTACCATT ATCTTGGGTT TCCATGGTGC 
S AAGTTTAGCT ATTTTTTTAT GCAKATATT AAAGTTGCAA 
J CAACCCTCGG TTAATGGGCA AACATTCTCC TGGGGTAGAA 
Jq ATTTAGCCCG aaaactgcag tttctg\ggg TGGCTGCCAG 
© 25 TGCTTTGCTC TGGCTTTGAC AGGTTGAAAT AGYCCCCATG 
AGWACTCCAG ACTGTGCTGG AGTCCCAAAG TTAGGAGGGC 
GGGACAGGCT GCTGCTTTGG TCTTTAGGAT CTAGGAARAA 
GCCAAGACAG AGTTCCCTCC CCTAGAAACtVtGCAGCCTG 
CTGGCACTTT AAGATAGCCT TCTTTAGAAC ATGAGTTAGT 
30 CTGACGTGTA AACAGCCTAT KGTTGCTCGG AG^GGACCA 
TTCCCTGTCT GCATGCCTAA GACTTCTAGA GCAQCCAACG 
ATTAAAGAAA AAAAAAAAAA AAAAAAAAAA AAAAAV 



637 

TTCCCCTGAA 687 
CATCTGACTT 737 
GGATCAGTGA 787 
GCTGAGTCCT 837 
GTATTCAACA 887 
GCTTCTAGCT 937 
ACAAAAATGC 987 
ACCGTCTGTC 1037 
ATTCCTTCTA 1087 
TTATGTTTCT 1137 
CTAACAACTT 1187 
GCTTTTGCCT 1237 
TGGCCTGCAG 1287 
TGCTGCTCAA 1337 
GCCAAAGTAA 1387 
TACTGTGTTG 1437 
TTAATTTTGC 1487 
TATTACTCAC 1537 
GCAAGCAGAG 1587 
TGAATTGTCT 1637 
ACTACAGCCG 1687 
ASCSTGGAAC 1737 
CATGGAGCCT 1787 
TTACAGAGGG 1837 
GAAGTCAGCC 1887 
TGGTAGTATT 1937 
TTTTCTCCAC 1987 
TATATGCAAC 2037 
2072 



SEQ ID NO: 5: 
35 SEQUENCE LENGTH: 603 

SEQUENCE TYPE: nucleic acid 
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• 



10 



3 15 



*20 



*»25 



30 



TOPOLOGY : linear 
MOLECULE TYPE: cDNA to mRNA 
ORIGINAL SOURCE: 

ORGANISM: Mus 
FEATURE: 

NAmIs/KEY: CDS 
LOCATION: 1 .. 603 
IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: SEQ ID NO: 
ATG AAG CCG TAC TTC TGC CAT GTC 
Met Lys Pro Tyfr Phe Cys His Val 

1 \ 5 

TTC TGC TTC CTA ATC AGA CTT TTA 
Phe Cys Phe Leu he Arg Leu Leu 



35 



GAA ATC AAT GGC 
Glu He Asn Gly 

TTT TCA TTT CAC 
Phe Ser Phe His 

TCT TGT AAA TAC 
Ser Cys Lys Tyr 

TTA AAA ATG CGA 
Leu Lys Met Arg 

GTC CTC TGC GAA 
Val Leu Cys Glu 

AGC GGA AAT GCG 
Ser Gly Asn Ala 

ATG CTC TGT CTA 
Met Leu Cys Leu 



TCGy GCC 
Ser Via 
25 

AAT Gi 
Asn Gl; 
35 

CCT GAG 
Pro Glu 
45 

TTG TTC 
Leu Phe 
55 

CTC ACC 
Leu Thr 
65 

GTG TCC 
Val Ser 
75 

TAT CAT 
Tyr His 
85 



GAT CAT 
Asp His 

GGT GTA 
Gly Val 



T GTC 
Val 



Th! 



AGA «AG 
Arg (Mu 

AAG ACI 
Lys Thr 

ATC AAG 
He Lys 

CTG TCA 
Leu Ser 



5 

TTT GTC 
Phe Val 
10 

ACA GGA 
Thr Gly 
20 

AGG ATG 
Arg Met 
30 

CAG ATT 
Gin He 
40 

CAG CAG 
Gin Gin 
50 

AGA GAA 
Arg Glu 
60 

AAG GGA 
ys Gly 
70 
iT CCA 
Asn\ Pro 
80 

AAC AVlC 
Asn 

90N 



30 



60 



90 



120 



150 



180 



210 



240 



270 
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AGC GfC TCT TTT TTC 
Ser Val\ Ser Phe Phe 
\ 95 
AGC TCC qAG GGA AGC 
Ser Ser Gin Gly Ser 
\ 105 
CTG TCC ATT\TTT GAC 
Leu Ser IleVhe Asp 
\ 115 
GAA AGG AAC CTT AGT 
Glu Arg Asn Leu Ser 
\ 125 

ATT TAT GAA TCC\CAG 
He Tyr Glu Ser Gin 
l\j5 

AAG CTC TGG CTA CCC 
Lys Leu Trp Leu Pro\ 
145 

TTC GTT GTG GTA CTC 
Phe Val Val Val Leu 
155 

CTT ATC ATC TGG TTT 
Leu He He Trp Phe 
165 

GGA TCC AGT GTG CAT 
Gly Ser Ser Val His 
175 

TAC ATG TTC ATG GCG 
Tyr Met Phe Met Ala 
185 

AAA AAG TCT AGA CTT 
Lys Lys Ser Arg Leu 
195 

TAA 

SEQ ID NO: 6: 



CTA AAC AAC CCA GAC 
Leu Asn Asn Pro Asp 
100 

TAT TAC TTC TGC AGC 
Tyr Tyr Phe Cys Ser 
110 

CCA CCT CCT TTT CAA 
Pro Pro Pro Phe Gin 
120 

GGA GGA TAT TTG CAT 
Gly Gly Tyr Leu His 
130 

CTC TGC TGC CAG CTG 
Leu Cys Cys Gin Leu 
140 

GTA GGG TTG CCA GCT 
Val Gly Leu Pro Ala 
V 150 
03T TTT GGA TGC ATA 
Leu Phe Gly Cys He 
\ 160 
TCaW AAG AAA TAC 
Ser Lys Lys Lys Tyr 
\ 170 
GAC CCTyAAT AGT GAA 
Asp ProVsn Ser Glu 
\ 180 
GCA GTC AAC ACA AAC 
Ala Val AsnVThr Asn 
\ 190 
GCA GGT GTG ACC TCA 
Ala Gly Val ThASer 

\oo 
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q15 



SEQUENCE LENGTH: 836 
SEQUENCE TYPE: nucleic acid 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acids (cDNA containing 3'- and 5 '-untranslated 
5 sequences) 

ORIGINAl\sOURCE: 

ORGANISM: Rattus 
FEATURE: 

NAMfVkEY: CDS 
10 L0CATJ0N: 35 .. 685 

IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: SEQ ID NO: 6 
CTGGAGGGGA AGAGTGCAGC TGTTCCTGGC AGAC 
ATG AAG CCC TAC\TTC TCG TGC GTC TTT GTC 
Met Lys Pro Tyr Phe Ser Cys Val Phe Val 

m i \5 10 

{§ TTC TGC TTC CTA ATC AAA CTT TTA ACA. GGA 
W Phe Cys Phe Leu IleUys Leu Leu The Gly 
ffi 15\ 20 

U20 GAA CTC AAT GAC TTG GCC AAT CAC AGG ATG 
U Glu Leu Asn Asp Leu Ala Asn His Arg Met 

25 \ 30 
TTT TCG TTT CAC GAT GGA W GTA CAG ATT 
Phe Ser Phe His Asp Gly Gly Val Gin He 
35 \ 40 
TCT TGT AAC TAC CCT GAG ACt\gTC CAG CAG 
Ser Cys Asn Tyr Pro Glu Thr Val Gin Gin 
45 \ 50 

TTA AAA ATG CAG TTG TTC AAA GAOy AGA GAA 
30 Leu Lys Met Gin Leu Phe Lys Asp W Glu 

55 \ 60 

GTC CTC TGC GAC CTC ACC AAG ACC AAG GGA 
Val Leu Cys Asp Leu Thr Lys Thr Lys\ Gly 
65 \70 
35 AGC GGA AAC ACC GTG TCC ATC AAG AAT CGG 
Ser Gly Asn Thr Val Ser He Lys Asn Pre 



S25 



34 
64 



94 



124 



154 



184 



214 



244 



274 
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nl5 



H20 



25 



30 



35 



ATG TC 
Met Sef 

AGT GTC 
Ser Val 

AGC TCC 
Ser Ser 

CTG TCG 
Leu Ser 

GAA AAG 
Glu Lys 

ATT TAT 
He Tyr 

AAG CTT 
Lys Leu 

TTT GTG 
Phe Val 

TTT ATC 
Phe He 

AGA TCC 
Arg Ser 

TAC ATG 
Tyr Met 

AAA AAG 
Lys Lys 



TGT CCA 
Cys Pro 

IT TTT 
Ser Phe 

CAG\GGC 
GlnVly 

ATT Tl 
He Phi 

AAC CTT 
Asn Leu 

GAA TCC 
Glu Ser 

TGG TTA 
Trp Leu 

GCA GCG 
Ala Ala 

GTC TGG 
Val Trp 

AGT GTG 
Ser Val 

TTC ATG 
Phe Met 

TCC AGA 
Ser Arg 



75 

TAT CAG CTG 
Tyr Gin Leu 
85 

TTC CTA GAC 
Phe Leu Asp 
95 

AGC TAC TTT 
Ser Tyr Phe 
105 

GAC CCA CCC 
Asp Pro Pro 
115 

LGT gga gga 

Ser Gly Gly 
12\ 

CAd CTT TGT 
GlnYeu Cys 
135 

CCC GTA GGG 
Pro Va\ Gly 
145 

CTC CTT TTT 
Leu Leu 
155 

TTT GCA AAA 
Phe Ala Lys 
165 

CAC GAC CCT 
His Asp Pro 
175 

GCG GCA GTC 
Ala Ala Val 
185 

CTT GCA GGT 
Leu Ala Gly 



Ph\ 



80 

TCC AAC AAC 
Ser Asn Asn 
90 

AAC GCA GAC 
Asn Ala Asp 
100 

TTA TGC AGC 
Leu Cys Ser 
110 

CCT TTT CAA 
Pro Phe Gin 
120 

TAT TTG CTT 
Tyr Leu Leu 
130 

TGC CAG CTG 
Cys Gin Leu 
140 

TGT GCA GCT 
Cys Ala Ala 
150 

GGA TGC ATA 
Gly Cys He 
160 

t G AAG TAC 
;s Lys Tyr 
170 
AAT\AGC GAG 
Asn Ser Glu 
180 
AAC ACA AAC 
Asn ThrVsn 
10 

ACA GCA C( 
Thr Ala Pre 



304 



334 



364 



394 



424 



454 



484 



514 



544 



574 



604 



634 
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195 200 
CTT \MG GCT TTG GGG AGA GGA GAA CAC TCT 
Leu A\g Ala Leu Gly Arg Gly Glu His Ser 
205 210 
TCA TGT\ CAA GAC CGG AAT TAA 
Ser Cys\Gln Asp Arg Asn 
215 

TTTGTTTAfT TCTATTTTAA AAGAAAGACA TTTTTTCCCC TAAAGATAAT 
TTTTGTATTT TTATGTGAAA GTCTGAATCT TCATTTTAAC TCGACTTATA 
TACTCTGTGG TATATTAAAA ATAATGTTTG TGAAAAAAAA AAAAAAAAAA 
A 



664 



685 



735 
785 
835 
836 



SEQ ID NO: 

SEQUENCE LENGTH: 27 
s 15 SEQUENCE TYPE: nucleic acid 
j TOPOLOGY: linear 

MOLECULE TYPE :\ other nucleic acids (synthetic DNA) 

FEATURE: 

NAME/KEY :\ primer bind 
!20 LOCATION:\l .. 27 

IDENTIFICATION METHOD: E 

SEQUENCE DESCRIPTION: SEQ ID NO: 

CTGCTCGAGA TGAAGCCCTA CTTCTCG 



27 



25 SEQ ID NO: 8: 

SEQUENCE LENGTH: 32 
SEQUENCE TYPE: nucleic Vcid 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acids (synthetic DNA) 
30 FEATURE: 

NAME/KEY: primer bihd 
LOCATION: 1 .. 32 
IDENTIFICATION METHOD :\E 
SEQUENCE DESCRIPTION: SEQ ID NO: 8 
35 ACCCTACGGG TAACGGATCC TTCAGCTGGtJ AA 



32 
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SEQ ID NO: 9: 
SEQUENCE LENGTH: 30 
SEQUENCE TYPE: nucleic acid 
TOPOLOGY: linear 

MOLECULBy TYPE: other nucleic acid (synthetic DNA) 
FEATURE: 

NAk/KEY: primer bind 

LOCATION: 1 .. 30 

IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: SEQ ID NO: 9 
TAACTGTTTC TCGAGAACAT GAAGTCAGGC 



30 



SEQ ID NO: l\ 
SEQUENCE LENGTH: 30 
;15 SEQUENCE TYPEi nucleic acid 
TOPOLOGY: lineW 

MOLECULE TYPE: yther nucleic acid (synthetic DNA) 
FEATURE: 

NEME/KEY: Winer bind 
20 LOCATION: L. 30 

IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: SEQ ID NO: 10 
ATCCTATGGG TAACGGATCG TTCAGCTGGC 



30 



25 SEQ ID NO: 11: 

SEQUENCE LENGTH: 35 
SEQUENCE TYPE: nucleic aisid 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid (synthetic DNA) 
30 FEATURE: 

NEME/KEY: primer bine 
LOCATION: 1 .. 35 
IDENTIFICATION METHOD: 
SEQUENCE DESCRIPTION: SEQ ID NO:\ll 
35 CGTGATATTG CTGAAGAGCT TGGCGGCGA^ TGGGC 



35 



16 - 
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SEQ ID NS: 12: 
SEQUENCE LENGTH: 34 
SEQUENCE TYPE: nucleic acid 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid (synthetic DNA) 
FEATURE: \ 

NAME/KEY: primer bind 

LOCATION\ 1 .. 34 

IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: SEQ ID NO: 12 
CATTCAAGTT TCAGGGAACT AGTCCATGCG TTTC 



34 



p 

5 5 | 

03 
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